Modeling Reduced Pronunciations in German
نویسندگان
چکیده
This paper deals with pronunciation modeling for automatic speech recognition in German with a special focus on reduced pronunciations. Starting with our 65k full form pronunciation dictionary we have experimented with different phone sets for pronunciation modeling. For each phone set, different lexica have been derived using mapping rules for unstressed syllables, where schwa-vowel+[l n m] are replaced by syllabic [l n m]. The different pronunciation dictionaries are used both for acoustic model training and during recognition. Speech corpora consisted of television programmes, which contain signal segments of a varying acoustic and linguistic nature. The speech is produced by a wide variety of speakers, with linguistic styles ranging from prepared to spontaneous speech and with changing background and channel conditions. Experiments were carried out using 4 news programmes and documentaries lasting more than 15 minutes each (with a total of 1h20min). Word error rates obtained vary between 19 and 29%, depending on the programme and the system configuration. Only small differences in recognition rates were measured for the different experimental setups, with slightly better results obtained by the reduced lexica. 146 Adda-Decker & Lamel
منابع مشابه
Investigating text normalization and pronunciation variants for German broadcast transcription
In this paper we describe our ongoing work concerning lexical modeling in the LIMSI broadcast transcription system for German. Lexical decomposition is investigated with a twofold goal: lexical coverage optimization and improved letter-to-sound conversion. A set of about 450 decompounding rules, developed using statistics from a 300M word corpus, reduces the OOV rate from 4.5% to 4.0% on a 30k ...
متن کاملHow are words reduced in spontaneous speech?
Words are reduced in spontaneous speech. If reductions are constrained by functional (i.e., perception and production) constraints, they should not be arbitrary. This hypothesis was tested by examing the pronunciations of highto mid-frequency words in a Dutch and a German spontaneous speech corpus. In logistic-regression models the "reduction likelihood" of a phoneme was predicted by fixed-effe...
متن کاملUsing acoustic models to choose pronunciation variations for synthetic voices
Within-speaker pronunciation variation is a well-known phenomenon; however, attempting to capture and predict a speaker's choice of pronunciations has been mostly overlooked in the field of speech synthesis. We propose a method to utilize acoustic modeling techniques from speech recognition in order to detect a speaker's choice between full and reduced pronunciations.
متن کاملAutomatic detection of anglicisms for the pronunciation dictionary generation: a case study on our German IT corpus
With the globalization more and more words from other languages come into a language without assimilation to the phonetic system of the new language. To economically build up lexical resources with automatic or semi-automatic methods, it is important to detect and treat them separately. Due to the strong increase of Anglicisms, especially from the IT domain, we developed features for their auto...
متن کاملA Neurophysiological Investigation of Non-native Phoneme Perception by Dutch and German Listeners
The Mismatch Negativity (MMN) response has often been used to measure memory traces for phonological representations and to show effects of long-term native language (L1) experience on neural organization. We know little about whether phonological representations of non-native (L2) phonemes are modulated by experience with distinct non-native accents. We used MMN to examine effects of experienc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001